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Abstract 

We prove that termination is undecidable for non-length-increasing string rewriting systems, 
using linear-bounded automata. On the other hand, we prove the undecidability of confluence for terminating 
rewriting systems when terms begin by a fixed symbol. These two results illustrate that sometimes restriction of 
problem to recognizable domains modify decidability properties, sometimes it does not. (We only consider finite 
terms). 


Introduction 


With two problems, we prove the influence of initial configurations on rewriting 
decision properties. The first problem concerns termination, and the second, confluence. 

Termination problems are fundamental in rewriting because they correspond to 
program termination for all data [Dershowitz & Jouannaud]. Many termination criteria have 
been studied [Dershowitz] but, generally, termination is undecidable, even for one left-linear 
rule [Dauchet] or for a semi-Thue system [Huet & Lankford]. Termination problems for one lin- 
ear-rule or one rule on words remain open. But in this last case, if the rule is non-length-increas- 
ing, termination is clearly decidable. 

Here, we prove undecidability of termination of non-length-increasing string re- 
writing systems (i.e. non-length-increasing semi-Thue systems). This problem is similar to lin- 
ear-bounded automata termination [Book] and has been stated in the case of graphs by Litovsky 
and Metivier [Litovsky & Metivier]. Therefore we revisit a paper of Hooper [Hooper], in which 
he studied termination of Turing machines and proved that termination is undecidable for linear- 
bounded automata, and more generally, for Turing machines. Using technics suggested by 
Hooper, we prove directly undecidability of termination for non-length-increasing string rewrit- 
ing systems. 

In a first part, we construct a class of linear bounded automata whose termination 
is reduced to the Post correspondance problem. This result is well-known but we use our 
construction in the second part, and observe that undecidability subsists if we suppress the 
constraint of beginning from an initial configuration. In the third part, we want to bring a fact 
out: the link between decidability and recognizable restrictions on terms. Recognizable 
restriction means that terms belong to a recognizable language. Therefore, in opposition to this 
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first result, we prove that confluence for terminating rewriting systems becomes undecidable if 
we restrict terms configuration to some recognizable set. (It is well-known that confluence is 
decidable for noetherian rewriting systems [Newman]). Confluence on recognizable tree 
languages is interesting because these languages are sorts (the finite automaton being the 
signature). Note that Otto proves that confluence on some congruence class is undecidable 
[Otto] but congruence classes are generally not recognizable. 


1 - Termination of linear-bounded automata 


Linear-bounded automata have been created by Myhill [Myhill] and very studied: since 
[Kuroda]. In particular, Hooper studied the undecidability of termination of Turing machine and 
linear-bounded automata [Hooper]. He called this problem immortality problem. Moreover, 
Hopcroft and Ullman showed that to every linear-bounded automaton, we can associate an 
equivalent terminating linear-bounded automaton [Hopcroft & Ullman]. 

In this part, we prove directly the undecidability of termination for a class of linear-bounded 
automata which restore their initial configuration when they do not stop, using a suitable 
construction for the more general result of the second part. 


Definition 1-1 : A machine terminates if and only if it stops for all data. 


Definition 1-2 : A linear-bounded automaton (LBA) can be seen as a particular Turing 
machine, Its tape is an input/output tape whose length is linearly dependent of data length. A 
LBA is a sextuple (£,T',Q,Q9,Q,,A), where X is the data alphabet, T the work alphabet, Q is the 
states sét, Qo the initial states set, Qr the final states set, A is the next-move function. We suppose 
that the tape has the form #<d>#, where #,<,> are never modified and d is the data. 


Vocabulary : We use Turing machines notions : instantaneous description, initial 
configuration, computation step, computation. More precisely : 

- an instantaneous description (denoted ID) is a writing #<m,qam >#. It means that the head is 
reading the letter a, the word m; is on the left and mp is on the right, and q is the machine state. 
- an initial configuration is an instantaneous description #<qm># where q is an initial state. 

-a step computation ID, — ID2 means we can go from ID, to ID, with a transition of A. 

- a computation is a succession of computation steps from an initial configuration ID, to a final 
configuration IDy. 

We are not interested in the result but in the computation stop. Therefore, final configuration is 
not important. But we need two notions : beginning of computation and sub-computation : 

- a beginning of computation is a succession of computation steps, from an initial configuration 
ID, 

- ID; —> ... — IDy is a sub-computation if there exists ID) — ... > ID; ... — ID, a beginning 
of computation which contains it. 


Proposition 1-1 : [Hooper] Termination of linear-bounded automata is undecidable. 
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Post correspondance problem : [Post] 

The Post correspondance problem P(@,y) over an alphabet X is given by two morphisms o and 
from I* to X*. P(o,) is solvable if and only if there exists ny in I* such that ọm = ymp). 
The Post correspondance problem is well-known undecidable. 


We are working on a specific class of deterministic LBA, denoted Apost associated to the Post 
problem. A machine Agy of this class is associated to two morphisms ọ and w . If the tape of 
Agy contains two words my and my, such that my = @(m)) = y(m)) then the machine loops 
passing by its initial configuration again. 


Definition 1-3 : Apost is a set of LBA associated to the Post problem. 

Apost = { AO = (ZT, Qo, {q0}: 2,A) | p and y two morphisms from I* to X* }, with 
-2=lTUXvU {<,>}, Land X two disjoint finite alphabets. 

.P=ZXUIUX, X and I constructed from X and I: for all x in X, x is in X and for all iin I, iis in I. 
. Qoy is the set of the states of the automaton Agy. 

Appendix I contains in details the behaviour fonction A of Agw. The following little program 
and the two examples explain the behaviour of a machine Agy 


Program Agy ; 
* The data is a word with letter of I and X, The head is on the first letter * 
stop := false ; 
while not stop do 
* search for the letter of I the most on the right * 
* I(Head) means that the head reads a letter of I * 
while I(Head) do move := right ; od 
* we verify that m = ip...igijX1X2,--Xp, With P(izi9...1p) = X1X2--Xm * 
while not (alloverlined(tape) or stop) do 
i:= Head ; overline(Head) ; 
* we search for the first non overlined letter of X, on the right * 
while Ioverlined(Head) do move := right ; od 
while Xoverlined(Head) do move := right ; od 
* now the tape has the form ip...ijij.y-.i,X]---%_1%|--Xm * 
* we verify that Xķ..-Xķ+p = PG) and if it is true, we overline Xķ..-Xk+p * 
researcho(i) ; 
* stop = True if @(i) is not on the right hand-side * 
* we now search for the first non overlined letter of I, on the left * 
while Xoverlined(Head) do move := left ; od 
while loverlined(Head) do move := left ; od 
od 
* we do exactly the same thing with y * 
while not (alloverlined(tape) or stop) do 


od 
od 
end 


Example 1: 

I= {a,b}; X= {1,2,3}; 

(a) = 123 ; p(b) = 32 ; wa) = 23 ; y(b) = 321. 

- The tape contains the data ab32123. So the initial configuration is #<qgab32123># 

- The machine searches for the last non overlined letter of I. It is b. 

The configuration is now #<aq7b32123>#. 

- The machine overlines b. The configuration is #<abqo(b)132123># 

- The machine verifies that 3 is the first letter of ọ(b). Since it is true, it overlines the letter 1. 
The configuration is #<ab3qe(py22123>#. 

- It verifies that 2 is the second letter of @(b) and overlines it. Since (b) contains only two let- 
ters, the machine searches for the non overlined letter of I the most on the right. It is a. 

The configuration is now #<q,,4ab32123>#. 

- The machine overlines a, The configuration is #<aqgq)}b32123>#. 

-If searches for the first non overlined letter of X. It is 1. 

The configuration is #<ab32qgq),123>#. 

- It verifies that 1 is the first letter of p(a). Since it is true, it overlines it. 

The configuration is #<ab321qyqy223>#. 

- In the same way, the automaton overlines 2 and 3, the second and the third letters of o(a). 
The configuration is #q,.<ab32123># 

- There is no more non overlined letter of I. So the automaton replaces the overlined letters by 
the same non overlined letters, verifying that all the data is overlined. 

The configuration is #<ab32123q,ecep>#. 

- The head goes to the letter of I the most on the right. It is b. 

The configuration is #<aqa9qinb32123># 

- As before, the machine overlines b and searches for the image of b by w. It is 321. 

The configuration is #<aq;e, b32123># 

- The machine overlines a and searches for the image of a by p. It is 23. 

The configuration is now # Qret <ab32123># 

~ There is no more letter of I non overlined. So the automaton replaces the overlined letters by 
the same non overlined letters, verifying that all the tape is overlined. 

The configuration is #<ab32123qyese, ># 

- Now, the machine has verified that the data had the form #<mm’># with @(m) = y(m) = w. 
m represents the mirror of the word m. 

- The initial configuration is restored when the Post correspondance problem P(@,1) is satisfied. 
The configuration is #<qoqab32123>#. 


Example 2: 

The tape contains the word a132. The initial configuration is #<qoa132># 

- The machine overlines a. The configuration is #<aq g(a) 1132># 

- It verifies that 1 is the first letter of @(a) and overlines it. 

The configuration is now #<al qgy232># 

~ It verifies that 3 is the second letter of p(a). Since it is false, the machine stops. 


Definition 1-4 : An initial configuration is proper if and only if it has the form #<qgmym;># 
(the mirror image of my is represented by my). with my word in X*, my word in I", og initial 
state of the machine. 
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Lemma 1-1: 
For all machines of the class Apos, if the initial configuration is not proper then the machine 
stops. 


Proof : Using the definition of the machine. O 


Lemma 1-2: 
For all machines in Apos; starting from a proper initial configuration #<qgmmy>#, the 
machine loops passing by its initial configuration again if and only if my = (my) = (my). 


Proof: From the definition of the machine. Q 


We get as corollary of these lemmas the next proposition. 

Proposition 1-2: 

(1) Termination in undecidable for the class Apgs; 

(2) If an automaton Agy loops for a data d then it passes by its initial configuration again. 


Proof: 

(1) According to lemmas 1-1 and 1-2, Agw loops if and only if my = @(my) = (my). But it is 
not decidable wether (my) = (my) (Post correspondance problem). Therefore, termination of 
the class Apost is undecidable. 

(2) From lemma 1-2, if the machine loops then it passes by its initial configuration again. Q 


Lemma 1-3: 

If there exists a computation which loops from some configuration (not necessarily reachable) 
then there exists a beginning of computation starting from a proper initial configuration which 
loops. 


Proof : See appendix II. 


II - Termination of non-length-increasing string rewriting systems 


We want to prove that termination of non-length-increasing string rewriting systems is 
undecidable. 


Definition 2-1 : A non-length-increasing string rewriting system is a system where rules have 
the form 1 — r with |I| 2 |r]. 1 and r are words. 


A particular class of non-length-increasing string rewriting systems is the class of length- 
preserving string rewriting systems. 

Definition 2-2 : A length-preserving string rewriting system is a system where rules have the 
form 1 — r with |I] = |r]. 1 and r are words. 
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We construct a class Rpost of rewriting systems associated to the class Apost of machines studied 
before: I and X are the two alphabets considered in paragraph I. È = {#,<>} UI U X. We 
construct I’ and I” from I, X’ and X” from X, and X’, &” from E: Va e Za’ Eer eta EL”. 
Q is a finite alphabet, disjoint from È. 
From a machine Agy of the class App, we construct the rewriting system Row. 
For all transitions (q1,a1) >, (a2,q2,Ri), for all a,b in È, we construct the rule: 

aqa,” b” >p a’ay’qob” 
and for all transitions (q1,41) >, (a2,42,Le), for all a,b in È, we construct the rule: 

a’b’qyay” Rp qb” az”. 
Gntuitively, x’ is a letter on the left side of the tape head, and x” a letter on the right side.) 


Rpogt is a Class of length-preserving string rewriting systems. 
Notation : to simplify, we write A for Agy and R for the associated system Row. 
>p is the transitive reflexive closure of -p 


Lemma 2-1 : 

for all min (X OZ” UQ)*, 

- either m € (X U Q)*.(Z” UQ)* 

- or m can be written in one way mya,” ujb maz” uzby ..-Mpan Unb Mpy] with n>O, 
M] Man My] E X U Q)*(Z” O O)*, upg. Uy E Q*, ap” pdg E D”, by’ dy? E X 


Proof : By induction on the length of m. 


Corollary 2-1 : 

Every word m of (2 O X” O Q)* - (X O Q)*(Z” U Q)* can be written in one way w]W2...Wy4] 
with w; E PX O Q)*E” U Q)*.2” forI<i< n+l, 

and w1, Way E (© U Q)*.(2” O Q)*. 


Proof : 
In the decomposition given by lemma 2-1, we suppose w4 = m4841” U4, Wi = bi Mip1âj41” Ujg1 for 
1<i<n+l and wy.) = bp’ Mpy O 


Definition 2-3 ; Let m be a word of (X? UZ” U Q)*. We define the signature of m by 

-Ifm € (2 UQ)*.(2” U Q)* then its signature is the empty set. 

- Else, m can be written in one way m,a;”u;b;’...mya,)”Upb_’My,4; and its signature is the set 
of occurences of letters aj”. An occurence is denoted by the length of the shorted prefix of m 
containing this occurence. 


We remark that the signature underlines the occurences of letters of £”, the successor of which 
is in X’, without consideration of the states of Q. 


Examples : 
X= {a,b,c,d,e,f,g} . m = acfgc. The occurence of the first letter c in m is 2, the occurence of the 
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second letter c in mis 5. 

Z= {0,1}, Q={q} 

m= 0’10’q0’q0" 1” . sign(m) = Ø 

m = 071’0"q0’q0’1’q0’0” 1’ . sign(m) = {5,11}. 


Lemma 2-2 : 
The size and the signature of a word m are unchanged when m is reduced with rules of R. 


Proof: 

Let m be a word of (Y? UX” U Q)*. 

Ist case: m € (X? U QYZ” U Q}. 

then sign(m) = Ø. We apply on m a rule of R. 

a) the rule has the form a’qya,”b” —p a’aq’qob” , a and b some letters of È. This rule can be 
applied only if m is written m;a’q]a;” b” m, with m; € (X U Q)* and m e (2”’ U Q)*. Then, 
applying the rule, we get the word m’ = mya’az’qyb”m, Hence |m] = |m’| = |m] + [mg] + 4. 
Moreover, m’ € (X U Q)*.(Z” U Q)* therefore sign(m’) = Ø = sign(m). 

b) the rule has the form a’b’qya;”’ p a’qgb”’ a2”, a and b some letters of X. This rule can be 
applied only if m is written m;a’b’qja;” m with m; e (2’ U Q)* and m € ©” u Q)*. 
Applying this rule, we get the word m’ = mya’qzb”a7”’m2, Hence |m| = |m’| = |m] + [mg] + 4. 
Moreover, m’ E (X U Q)*.(2” U Q)* therefore sign(m’) = Ø = sign(m). 

2nd case: m = m4aq7Uzby’...Myay” Ugby’ Mn+ 

We apply one rule of R to m. A rule can be applied only on a subword of m in (X? U Q)*.2” U 
Q)*. But we have just shown that for m € (X U Q)*.(2” U Q)*, rewriting preserves signature 
and size. Hence this result is true for each m in (X UX” O Q) üQ 


Lemma 2-3: 
For every word m in (© UZ” O Q)*-(X O Q)*.(L” O Q)* written wyw2..Wy4 7 
(mrp t) S (t = titz...tn}] such that wi ®èR ti, i=1,n+1) 


Proof : 

=) Let m = w1W2...Wp+1. We have shown with lemma 2-2 that rules of R can be applied only 
on wj. Hence (m eg t) = (t = tyto...ty4, such that wj p ti i= 1,...,.n+1) 

=) if t = tyty...ta4y with wi °p tj (i= 1,...,.n+1) then W W2...Wp+1 PR t1t2---tn+1- 

Therefore m >g t. Q 


Definition 2-4 : 
Let m a word of form a’v’qw”b” with q € Q, v? e 2’*, w” e D”*, a’e P’ and b” € 2”. We 
associate to m the configuration of the machine A: C(m) = avqwb. 


Remark: a and b are never read by the machine and mark the end of the tape (instead of #). 


Lemma 2-4 : 
m € (2'+.0.2”t), m 3pm’ = C(m) 3,4 C(m’). 
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Proof: 

=>) To the word m, we associate the configuration C(m). The rules of R have the following two 
forms: 

(r1) a’qyay”b” —p a’aq’qob”, for some a and b in È. 

(12) a’b’gya,” 2p a’qnb” ay”, for some a and b in È. 

Ist case : rule (r1). 

To apply (r1) on m, m should be of form v’a’qay”b”w”, v? € 2’*, w” e LY” *. Then C(m) = 
vaqyajbw . We know that m -(r1)>p w © m’ = v’a’aq’qnb”w”. So C(m’) = vaagqgbw . But 
(r1) has been constructed from transition (q1,a) 4 (a7,42,D). Hence C(m) +, C(m’). 

2nd case : rule (r2). 

To apply (£2) on m, m should be of form v’a’b’qya;”w”, V? € 2’* and w” € Z”*, Then C(m) 
= vabq,ajw . Since m -(rl)p w & m’ = v’a’qyb” ay” w”, C(m’) = vaqgbazw. But (12) has 
been constructed from transition (q),a) >, (a2,q2,G). Hence C(m) >, C(m’). 

=) Suppose that m and m’ are such that C(m) — a C(m’). Transitions of A are of the following 
two forms: 

(t1) (q1,41) >A (42,42). 

(t2) (qy,41) 4 (a2,42,6). 

C(m) = &oq)a,@B with a,ß € È and o,w € L*. On C(m), we can apply (t1) or (t2). 

Ist case: transition (t1). 

To every transition (t1) and for all a,b in È we associate the rule (r1) : a’qyay”"b” > a’aq’qnb”. 
The word m associated to configuration C(m) is &’o’qya,;”o”B”. The word m’ such that C(m) 
-(t1)> a C(m’) is @’0’aq’qow”B” . Therefore m’ = v’a’ay’qgb” w” supposing v’a’ = a’o’ and 
b”w” = @”B”. Then m = v’a’qyay”b”w”. Hence, m Sp m’. 

2nd case : transition (t2). 

To every transition (t2) and for all a,b in 2 we associate the rule (12) : a’b’qyay” >g a’qnb” ay”. 
The word m associated to C(m) is @’o’qyay”w”B” (as in first case). m is written 
v’a’qyay”b” w” and the word m’ such that C(m)-(t2)-, C(m’) is written v’qga”az”b”w”’. 
Since C(m) is a configuration, v’ contains at least one letter. Therefore we can apply a rule (12) 
and m >g w. 0O 


Lemma 2-5 : 
The machine A does not terminate = The rewriting system R does not terminate 


Proof : Suppose that the machine A does not terminate, then starting from an initial 
configuration Cg there exists an infinite computation Cg >, Cy a, ... . But for every 
configuration C;, there exists a word mj in ’*.Q.X”* such that C; = C(m;). Hence there exists 
mo my, ... in Z’*.Q.2”* such that C(mp) >, C(m;) >, ... . According to lemma 2-4, C(mg) 
=a C(my) >A ... S Mg >p My —p... . Therefore R does not terminate. O 


Lemma 2-6 : 
The rewriting system R does not terminate = there exists an infinite subcomputation C] 3,4 C2 
TDA wee 


Proof : We suppose that R does not terminate. 
Hence there exists an infinite derivation m —p ... . From lemma 2-1, either m € (P U Q)*.(Z” 
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U Q)*, or m is written mja” ubg.. Mpy. 

1D) if m e @ U Q.” UQ)* 

On m, we can apply a rule of R. Therefore m = v;ovz with v; € (X U Q)*.Q* or vy € Q*, v2 
€ Q?” UQ)* or vz € Q*, and o € X*+.Q.E”+, To o we associate the A configuration : C(0). 
We cannot apply a rule on v; and v2 , so we apply it on o. Hence 6 >p.... According to lemma 
2-4, C(O) a - 

2) else m = m;a; ” u; b;’mya7”U2b9’...094; = W1W2--Wp+; (lemma 2-2). 

From lemma 2-3,m-?p t@t=t;...ty41 with wieg ti- Therefore, if the rewriting is infinite, there 
exists a w; such that rewriting restricted to w; is infinite. But we know that w; € (©? U Q)*.(2” 
U Q)*, so we can use the precedent case. Q 


Theorem 2-1 : Termination of length-preserving string rewriting systems is undecidable. 


Proof : it suffices to verify that construction which associates a string rewriting system Roy to 
the linear-bounded automaton Agy reduces termination problem in class Apost to termination 
problem for length-preserving string rewriting systems. For that, we can remark that proposition 
1-2, lemma 1-3 and lemma 2-6 involve that if Roy does not terminate neither do Agy. Indeed, 
if Rey does not terminate, Ay passes by a proper initial configuration again from which it 
loops. 

From Lemma 2-5 we obtain the converse of the theorem. O 


Corollary 2-2 : 
Termination of non-length- increasing string rewriting system is undecidable. 


Remark : 

Agy could loop from a non-reachable configuration and stopped from every initial 
configurations. Therefore it could terminate in the machine sense, without assure termination 
for Roy . Lemma 1-3 avoids this problem. 


III - Undecidability of confluence of terminating rewriting systems on qA* 


In this section, we want to show that properties for linear-bounded automata cannot always be 
translated for non-length-increasing string rewriting systems. Indeed, for a linear-bounded 
automaton we start from an initial configuration, where the tape head is on the left side of the 
tape. That should mean, for a rewriting system, that we work on words starting by a special letter 
symbolizing initial state of linear-bounded automaton. 


Definition 3-1 : [Huet] 
A rewriting system is confluent if for all u that reduces to two terms t and t’ there exists v such 
that t and t’ reduce to v. 


Theorem 3-1 : [Newman] 
Confluence is decidable for terminating rewriting systems. 


It is well known that confluence and ground confluence (confluence restricted to terms without 
variables) are not equivalent. In particular, ground confluence is undecidable for terminating 
term rewriting systems. Confluence implies ground confluence but the converse is false. 
Nevertheless, the following identification lemma shows that we can identify confluence of any 
semi-Thue system S with both confluence and ground confluence of the corresponding term 
rewriting system S’. 


Identification lemma : 
Let S be a semi-Thue system over a (non ranked) alphabet È. We associate to S the term rewrit- 
ing system S’ over the ranked alphabet x’ = {a(x) | x € E} O {$}. ($ is a constant). 
= {l(x) 9 r(x) /lo re S} 
Then tgu & t(x) 9 u(x) > 4$) ry u($) 


proof: obvious 0 


As a corollary, confluence of S, confluence and ground confluence of S’ coincide. 

From now, we use this identification and work only in the word case. We describe in this para- 
graph a terminating rewriting system. This system is not confluent on words of A* and its con- 
fluence is undecidable on qA*, q a fixed symbol of A. 


Let Agy be a machine of class Apost Studied in part I. We associate to this machine an alphabet: 
A= Qoy U EU {q dyes, Y, N}. We modify the machine Agy. We remote the last transitions 
which make the machine loop. Therefore, if the data has the form mm’ with w’ = (m) = y(m), 
the machine goes to the configuration #qe,qg<mm’># and stops. 

To this new automaton, we associate a rewrite system R1. It contains all the rules simulating the 
transitions of the machine: A transition of form (q,a) > Agy (b.q’,Le) is associated to the rule 
hqa >R, q'hb, a, b, h in A and q, q’ in Qoy; A transition of form (q,a) > a@y (b,q’ Ri) is asso- 
ciated to the rule qa —>p, bq’, a, b, in A and q, q’ in Qoy. 

Moreover, R1 contains the rules: 

dend R1 dyes 

V a e A-{#} yes4 R1 Ayes 

dyes## pi Y 

q ri <do 


We consider now this rewriting system R2: 
VfeA- {Y, N} fp) N 

V a E€ A, Ya >pgr2 N 

Va € A, Naga N 


R is the system constituted by the rules of R1 and R2. R is terminating. 


Lemma 3-1 : 
mel ,m € X*, qmm’># ->p Y if and only if o(m) = (m) = w. 


(we denote ñ the mirror image of m) 


Proof: 

=) if o(m) = y(m) = m’ then according to part I, the machine Agyy, from an initial configuration 
#<qdoin---11X1---Xp># goes to the instantaneous description #qeyqSip...1yX1...X)># with m = ij..iy 
and m’ = xj...X) . Hence with R we can go from the word qi...ijx1...Xp># to the word 
<doin---11x1.-Xp># and to qend<in---11X1-.-Xp>#. Applying the rule dend< R1 dyes we get 
yesin---11X1---Xp>#. Applying several times rules of form qyes@ R1 yes We get the word qyes#. 
Finally with the tule dyes# g1 Y we get the word Y. 

=>) we suppose that qifim’>#—p Y with m = iy...i and m’ = xy...Xp. 

m and m do not contain Y neither qyes neither any state of Qo. There is only one manner to 
get Y: using the rule qyes# >r; Y. Moreover, the only manner to get qyes is to apply the rule 
dend< RI yes: To generate qeng, we have applied rules simulating AQY. qip...ijX1...Xp># R 
<Qoin---1%1...Xp># tp Qend<in---11x1---Xp>#. 

But, we have seen in paragraph I that # <qoip...ijXy...Xp># >A #Qeng<in--11Xx1---Xp># if and only 
if P(iy...ig) = YCiz--in) = Xy-.Xp. Q 


Definition 3-2 : 

A word w is a normal form if there exists no word v with w p V. 

A word w has a normal form if w **p v for some normal form v. 

The set of normal forms of a word m (called irreducible forms) is denoted by IRR(m). 


Lemma 3-2: 
For allu in A* - {Y}, N is in IRR(u). 


Proof: 

- if u does not start by N neither by Y . Then applying the rule f p> N and possibly the rules 
of form Na —>g2 N we obtain the reduction u **p N. So N is in IRR(u). 

-if u starts by N : u = N.w 

If w =e then u = N and so u is irreducible. Hence N is in IRR(u). Else we apply several times 
rules Na —p2 N and we get N in IRR(u). 

- if u starts by Y : u = Y.w with w + £. Hence we can apply a rule Ya py N and possibly rules 
Na >? N. Therefore N is in IRR(u). O 


Lemma 3-3 : 
Vm € A*, IRR(m) c {¥,N,g}. Y m € At, IRR(m) c {YN} 


Proof : 
Suppose that there exists u € IRR(m), u € {Y,N,£}. Then, from lemma 3-2, u can be reduced to 
N. So u is not a normal form. Moreover, it is obvious that e € IRR(m) if and only ifm =£. 0 


Lemma 3-4 : 

Let qw be a word in qh*. 

If qw has the form qmm’ ># then 

i) IRR(qw) = {YN} iff o(m) = ym) = mw 


li) IRR(qw) = {N} Uf O(m) Am’ or wim) Am’ 
Else IRR(qw) = {N}. 


Proof: 

1) qw has the form qmm’># 

i) According to lemma 3-1, qmm’># —p Y if and only if ọ(m) = y(m) = m. 

Moreover gitim’># -p N from lemma 3-2. Hence {Y,N} c IRR(qw) if and only if ọ(m) = 
wym) = m’. Lemma 3-3 shows that IRR(qw) = {Y,N} iff o(m) = wim) = w. 

ii) ọ(m) # w or yim) # w & Y ¢IRR(qw). Butqmm’>#—>p N. Hence IRR(qw) = {N} 

2) qw has not the form qiam’>#. Then qw does not correspond to a proper initial configuration 
of Agy. Hence we apply rules of R without obtain the word qeng<m. Therefore Y ¢ IRR(qw). 
But qw p N. Hence IRR(qw) = {N}. 0 


Theorem 3.2 : q is a fixed symbol of a finite alphabet A. The following problem is undecidable. 
Instance : R semi-Thue, non-length-increasing, terminating, not confluent. 
Question : is R confluent on qA* ? 


Proof : According to lemma 3-4, R is divergent on qA* if and only if there exists m such that 
((m) = y(m). Hence R convergence on qA* is equivalent to Post problem. Consequently it’s 
undecidable, Q 


Using the identification lemma, we get the corollary: 
Corollary 3-1 : The following problem is undecidable. 
Instance: R a terminating, rewriting system. S a sort. 
Question : is R confluent on S ? 


Remark: S can be choosen very simple. For example, S is the set of terms of root q. 
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Appendix I : Definition of a machine Agy 


In the transitions, for i in I, o6); represents 7 letter of @(4), 
PG) = -Pki We use the same notation for w: wi); represents = letter of y(i), wi) = 
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ORAON 

Qoy = {90, q1 92» Aret reset again» Iret » Treset » Jend? V {dogj 1 € Tand i) = PG)1---PG)k; 
j € klif 9G) #8} U {doao if PÒ = 2} U {yoj i € I and YH) = y0 j € [La if 
WO ZEO {dyo EYG) = €}. 

. The tape is of form #<m>#, m is a word in (I U X)*. initially, tape head is on the first letter of 
m, and machine state is qo. 

. Ais the next-move function: 

A transition (q,a) — (b,q’,Dir) means that the head reads the letter a, the state of the automaton 
is q, and after applying the transition, the letter replacing a is b, the new state is q’ and the head 
goes to the direction Dir (Ri for rigth, Le for left). 

Vii’ €landVx eX, 

a) rules of equality verification: my = @(m)y). 

(qdo: > (i, qy, Ri) % research of i on the rigth side 

(q1: Gi, q1, Ri) 

(q1 +x) > (x , q2» Le) 

(q2,i) z3 G, Joi Ri) % research of its image by @ if it is not €. 

(dea D > G, doa » Ri) 

(o0 %9 > Os Gog » Ri) 

(paj » PO) > OOj, Io@jet > Ri) 

(odki. ; Wx) — (OD; > Trex Le) % we do it again for next i. 


(ret » X) > (X , Gret Le) 

ret D > Gs ret » Le) 

(dret: i) 0, doqiyi » Ri) % if the image of i by @ is not € 
(dret D 7G; deo Le) % if the image of i by Mis € 
(dret © > (<> dreset » Ri) 

(2-9) =>, Joio» Le) % if the image of i by ọ is € 
(doao Vow, dei)» Ri) % if the image of i’ by @ is not € 
(49017) > @ , dogo. Le) % if the image of i’ by Mis € 


(dopio > (<; dreset» Ri) 


b) rules of tape reset. 

(reset i) > (i, dreset» Ri) 
(dreset » X) > (X > Greset » Ri) 
(reset >) — È, drecom » Le) 
(again >» X) > (X » again » Le) 


c) rules of equality verification : my=w(m)). 

(again > => G + qy- Ri) % if the image of i by w is not € 
yi D > G dya- Ri) 

(dyi > X) > X , Gyayt » Ri) 

Cya VO) > YO), dyajer > Ri) 

yarns VO > Ws; > Iret » Le) 

(dret > x) > (x > Tret » Le) 

(diet D > G, Get » Le) 


(dret ») > G, Gyr » Ri) 

(det » D — G , Gyciyo » Le) 

ret ><) > (<; dreset » Ri) 

(again > 1) —> G, Gyo Le) % if the image of i by wis € 
(Gyo P) > @ . dyay » Ri) 

(Gyo D) > @ , Guayo» Le) 

(Iyo © > <, Greset » Ri) 


d) rules of restoration of initial configuration 
(reset > i) — (i, Greset » Ri) 

(dreset » X) > X , dreset » Ri) 

(dreset » >) > (> » dena » Le) 

(end >» X) > (X , dena » Le) 

(qend » i) — G@ qend » Le) 

(dend » <) > (<, do» Ri) 


Appendix II : Proof of lemma 1-3 


Let ID a configuration such that Agy loops from it. 

First case: ID is reachable 

If ID is reachable, there exists an initial configuration from which the machine has reached ID. 
Therefore, there exists a beginning of computation which loops. The machine Agy loops if 
andonly if P(@,y) is verified. Moreover, when Post property is verified, the machine loops 
passing by a proper initial configuration again. Hence there exists a beginning of computation 
starting from a proper initial configuration which loops. 

Second case: ID is not reachable. 

Let’s see the dependence graph between states of Agy. 


We write Q@ the set of states qog)j and Qy the set of states qui; - 
1) machine does not loop in one state, neither in a set Q9 or Qy. 
- For all states of Agy, when the machine applies a transition staying in one state, then the head 
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always goes in the same direction. But the tape is finite, on the right and on the left. Hence the 
machine cannot stay in the same state doing an infinite number of computation steps. 

- We consider the set Qo 

We stay in a state dpi if and only if j=1. 

* if] > 1 then we go to state Goj+1- Since j has a finite number of possible values, the machine 
does a finite number of computation steps staying in QQ. 

* if j = 0 then we replace a letter of I by a letter of J, staying in a state of Qe. But no transition 
in a state of Q@ transforms a letter of I to a letter of I. Therefore the number of letter of I on the 
tape decreases. Moreover, if there are no letter of I on the tape, we cannot apply transition being 
in Goo and staying in a state of Qg. 

Hence, we do a finite number of computation steps, staying in Qq. 

- We can do the same proof for Qy. 

` 2) The machine does not loop on Qọ and Gre 

When it passes from Q@ to qret , at least one letter of the tape has been overlined. Moreover, it 
passes from qret to Q@ overlining one letter of the tape. Since there’s a finite number of letters 
on the tape, there’s a finite number of computation steps. We can have the same argument for 
Qy and diet - 

3) From 1) and 2) we deduce the machine loops passing by qo, 41, 92> Gret» Sreset> Jagain» Tret » 
reset » Tend and by the sets Qo and Qy. But there is only one way to pass by qq: using the tran- 
sition (qend »<) — (<, qq, D). Therefore, the machine is in an instantaneous description of form 
#<qgm># which is, by definition, an initial configuration. Finally, according to precedent lem- 
mas, if the machine loops from an initial configuration, then it’s a proper one. O 


